A Technical Guide to Concatenative Speech Synthesis for Hindi using Festival
نویسنده
چکیده
Speech is the most natural and effective medium of communication among the human beings. Speech has played a great role in the evolution of human civilization. Speech synthesis is an artificial way of producing speech. The native speakers of any language use their knowledge base of various prosodic features during the speech production. These features they acquire unconsciously in childhood. With the help of these features, they are capable of expressing the meaning of any utterance and emotional states. It is still a challenging task to bring similar naturalness in artificial speech production (speech synthesis).This paper covers the details of how to develop a speech synthesizer using Festival tool. Two approaches have been discussed in details: Limited domain synthesis technique and Unit selection synthesis for Hindi. Apart from that how to configure Festival tool on Linux so that one can start working for a TTS also has been discussed. The purpose this paper is to give the full insights of technicalities involved while manipulating festival for a new language to the naive speech researchers. General Terms Speech Synthesis, Unit selection method, Grapheme to Phoneme, Formant
منابع مشابه
Text To Speech for Bangla Language using Festival
In this paper, we present a Text to Speech (TTS) synthesis system for Bangla language using the opensource Festival TTS engine. Festival is a complete TTS synthesis system, with components supporting front-end processing of the input text, language modeling, and speech synthesis using its signal processing module. The Bangla TTS system proposed here, creates the voice data for festival, and add...
متن کاملDesign of English to Hindi Corpus Based Text Conversion and Hindi Text to Speech Synthesis
English is a global language but is understood by few percentage of population in India. It continues to remain a barrier for rural population to learn and compete at a global level. Machine translation helps people from different places to understand an unknown language without the aid of human translator. A Text to Speech system generatesspeech from text given as input. The proposed system wi...
متن کاملDiphone-Based Concatenative Speech Synthesis System for Mongolian
This paper describes the first Text-to-Speech (TTS) system for the Mongolian language, using the general speech synthesis architecture of Festival. The TTS is based on diphone concatenative synthesis, applying TD-PSOLA technique. The conversion process from input text into acoustic waveform is performed in a number of steps consisting of functional components. Procedures and functions for the s...
متن کاملIndian Language Screen Readers and Syllable Based Festival Text-to-Speech Synthesis System
This paper describes the integration of commonly used screen readers, namely, NVDA [NVDA 2011] and ORCA [ORCA 2011] with Text to Speech (TTS) systems for Indian languages. A participatory design approach was followed in the development of the integrated system to ensure that the expectations of visually challenged people are met. Given that India is a multilingual country (22 official languages...
متن کاملFestival 2 - build your own general purpose unit selection speech synthesiser
This paper describes version 2 of the Festival speech synthesis system. Festival 2 provides a development environment for concatenative speech synthesis, and now includes a general purpose unit selection speech synthesis engine. We discuss various aspects of unit selection speech synthesis, focusing on the research issues that relate to voice design and the automation of the voice development p...
متن کامل